Discrimination-Aware Classifiers for Student Performance Prediction
نویسندگان
چکیده
In this paper we consider discrimination-aware classification of educational data. Mining and using rules that distinguish groups of students based on sensitive attributes such as gender and nationality may lead to discrimination. It is desirable to keep the sensitive attributes during the training of a classifier to avoid information loss but decrease the undesirable correlation between the sensitive attributes and the class attribute when building the classifier. We illustrate, motivate, and solve the problem, and present a case study for predicting student exam performance based on enrolment information and assessment results during the semester. We evaluate the performance of two discriminationaware classifiers and compare them with their non-discriminationaware counterparts. The results show that the discriminationaware classifiers are able to reduce discrimination with trivial loss in accuracy. The proposed method can help teachers to predict student performance accurately without discrimination.
منابع مشابه
Automatic classification of highly related Malate Dehydrogenase and L-Lactate Dehydrogenase based on 3D-pattern of active sites
Accurate protein function prediction is an important subject in bioinformatics, especially wheresequentially and structurally similar proteins have different functions. Malate dehydrogenaseand L-lactate dehydrogenase are two evolutionary related enzymes, which exist in a widevariety of organisms. These enzymes are sequentially and structurally similar and sharecommon active site residues, spati...
متن کاملA Comparative Study of Ensemble Methods for Students Performance Modeling
Student performance prediction is a great area of concern for educational institutions to prevent their students from failure by providing necessary support and counseling to complete their degree successfully. The scope of this research is to examine the accuracy of the ensemble techniques for predicting the student's academic performance, particularly for four year engineering graduate p...
متن کاملApplication of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملDiscrimination-Aware Association Rule Mining for Unbiased Data Analytics
A discriminatory dataset refers to a dataset with undesirable correlation between sensitive attributes and the class label, which often leads to biased decision making in data analytics processes. This paper investigates how to build discrimination-aware models even when the available training set is intrinsically discriminating based on some sensitive attributes, such as race, gender or person...
متن کاملProposing an Intelligent Monitoring System for Early Prediction of Need for Intubation among COVID-19 Hospitalized Patients
Introduction: Predicting acute respiratory insufficiency due to coronavirus disease 2019 (COVID-19) can diminish the severe complications and mortality associated with the disease. This study aimed to develop an intelligent system based on machine learning (ML) models for frontline clinicians to effectively triage high-risk patients and prioritize who needs mechanical intubation (MI). Material...
متن کامل